Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

نویسندگان

چکیده

Visual Information Extraction (VIE) has attracted considerable attention recently owing to its various advanced applications such as document understanding, automatic marking and intelligent education. Most existing works decoupled this problem into several independent sub-tasks of text spotting (text detection recognition) information extraction, which completely ignored the high correlation among them during optimization. In paper, we propose a robust System (VIES) towards real-world scenarios, is an unified end-to-end trainable framework for simultaneous detection, recognition extraction by taking single image input outputting structured information. Specifically, branch collects abundant visual semantic representations from multimodal feature fusion conversely, provides higher-level clues contribute optimization spotting. Moreover, regarding shortage public benchmarks, construct fully-annotated dataset called EPHOIE (https://github.com/HCIILAB/EPHOIE), first Chinese benchmark both extraction. consists 1,494 images examination paper head with complex layouts background, including total 15,771 handwritten or printed instances. Compared state-of-the-art methods, our VIES shows significant superior performance on achieves 9.01% F-score gain widely used SROIE under scenario.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The AdobeIndoorNav Dataset: Towards Deep Reinforcement Learning based Real-world Indoor Robot Visual Navigation

Deep reinforcement learning (DRL) demonstrates its potential in learning a model-free navigation policy for robot visual navigation. However, the data-demanding algorithm relies on a large number of navigation trajectories in training. Existing datasets supporting training such robot navigation algorithms consist of either 3D synthetic scenes or reconstructed scenes. Synthetic data suffers from...

متن کامل

Towards robust data association in real-time visual SLAM

Recent years have seen the emergence of systems capable of tracking in real-time the 6-D pose of a moving camera whilst simultaneously building a structural map of the surrounding environment. Such vision based simultaneous localisation and mapping (SLAM) systems have huge potential in terms of providing low cost and flexible 3-D location sensing, capable of operating with agile hand held devic...

متن کامل

Satisfying Real-world Goals with Dataset Constraints

The goal of minimizing misclassification error on a training set is often just one of several real-world goals that might be defined on different datasets. For example, one may require a classifier to also make positive predictions at some specified rate for some subpopulation (fairness), or to achieve a specified empirical recall. Other real-world goals include reducing churn with respect to a...

متن کامل

LearningPinocchio: adaptive information extraction for real world applications

The new frontier of research on Information Extraction from texts is portability without any knowledge of Natural Language Processing. The market potential is very large in principle, provided that a suitable easy-to-use and effective methodology is provided. In this paper we describe LearningPinocchio, a system for adaptive Information Extraction from texts that is having good commercial and s...

متن کامل

Real - World Visual Computing

Over the last decade, the tremendous increase in computational power of graphics hardware, in conjunction with equally improved rendering algorithms, have led to the situation today where real-time visual realism is computationally attainable on almost any PC, if only the digital models to be rendered were sufficiently detailed and realistic. With rapidly advancing rendering capabilities, the m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i4.16378